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STEIN ESTIMATION FOR THE DRIFT OF GAUSSIAN 
PROCESSES USING THE MALLIAVIN CALCULUS 

By Nicolas Privault and Anthony Reveillac 

City University of Hong Kong and Universite de la Rochelle 

We consider the nonparametric functional estimation of the drift 
of a Gaussian process via minimax and Bayes estimators. In this 
context, we construct superefHcient estimators of Stein type for such 
drifts using the Malliavin integration by parts formula and superhar- 
monic functionals on Gaussian space. Our results are illustrated by 
numerical simulations and extend the construction of James-Stein 
type estimators for Gaussian processes by Berger and Wolpert [J. 
Multivariate Anal. 13 (1983) 401-424]. 

1. Introduction. The maximum likelihood estimator ft of the mean fi G 
M. d of a Gaussian random vector X in W 1 with covariance o~ 2 I^d under a 
probability is well known to be equal to X itself. It is efficient in the 
sense that it attains the Cramer-Rao bound 

a 2 d = E„[\\X - Hll] = inf EJZ - fxf d ], (jl G R d , 

Zj 

over all unbiased mean estimators Z satisfying E^fZ] = /i, for all fi G M. d , 
where || • \\d denotes the Euclidean norm on M d . 

In [5], James and Stein have constructed superefficient estimators for the 
mean of X G M. d , of the form 

whose risk is lower than the Cramer-Rao bound o~ 2 d in dimension d > 3. 

The problem of drift estimation for Gaussian processes is of interest in 
several fields of application. For example, the process (^t)tg[o,T] can be 
interpreted as an observed output signal with decomposition 



x t = ut + x?, te[o,T], 
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where the drift (ut)t£\o,T] is viewed as an input signal to be estimated and 
perturbed by a centered Gaussian noise (^")te[o,T]j see > f° r example, [4], 
Chapter VII. Such results find applications in, for example, telecommuni- 
cation (additive Gaussian channels) and finance (identification of market 
trends). 

Berger and Wolpert [2, 11], have constructed estimators of a James-Stein 
type for the drift of a Gaussian process {X t ) te ^ 0T } by applying the James- 
Stein procedure to the independent Gaussian random variables appearing in 
the Karhunen-Loeve expansion of the process. In this context, u := (Xt)t^R + 
is seen as a minimax estimator of its own drift (ttt)ten + . 

Stein [10] has shown that the James-Stein estimators on M. d could be 
extended to a wider family of estimators, using integration by parts for 
Gaussian measures. Let us briefly recall Stein's argument, which relies on 
integration by parts with respect to the Gaussian density and on the prop- 
erties of superharmonic functionals for the Laplacian on M. d . Given an es- 
timator of fi £K. d of the form X + g(X), where g : R d -> R d is sufficiently 
smooth, and applying the integration by parts formula 

(1.1) EpKXt - m)gi(X)} = a 2 E^d m {X)] 

to g = <t 2 V log f = a 2 (d\ log f,...,dd log /) , one obtains 

d 

Efi[\\X + <r 2 Vlog/(X) — /i||^] = a 2 d + 4a 4 J2 E m 

i=i 

that is, X + ct 2 V log f(X) is a super efficient estimator if 

d 

^5 2 v / 7(^)<0, dx-&.e., 
i=i 

which is possible if d > 3. In this case, X + cr 2 V log f(X) improves in the 
mean square sense over the efficient estimator it which attains the Cramer- 
Rao bound a 2 d on unbiased estimators of fj,. 

In this paper we present an extension of Stein's argument to an infinite- 
dimensional setting using the Malliavin integration by parts formula, with 
application to the construction of Stein type estimators for the drift of a 
Gaussian process (Xt)te[o,T]- O ur approach applies to Gaussian processes 
such as Volterra processes and fractional Brownian motions. It also extends 
the results of Berger and Wolpert [2] in the same way that the construction 
of Stein [10] extends that of James and Stein [5], and this allows us to recover 
the estimators of the James-Stein type introduced by Berger and Wolpert 
[2] as particular cases. Here we replace the Stein equation (1.1) with the 
integration by parts formula of the Malliavin calculus on Gaussian space. 
Our estimators are given by processes of the form 

X t +D t logF, t£[0,T], 



df^J(X) 
Vf(X) 
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where F is a positive superharmonic random variable on Gaussian space and 
Dt is the Malliavin derivative indexed by t £ [0,T]. In contrast to the mini- 
max estimator it, such estimators are not only biased but also anticipating 
with respect to the Brownian filtration (J r t)t£[o,T\ - This, however, poses no 
problem when one has access to complete paths from time to T. 

When the variance of X is large it can be shown that the percentage gain 
of this estimator is at least equal to the universal constant 

16 r -( x 2 +y 2 +z 2 +r 2)/2 dxdydzdr 
1 ' ' Wr* x 2 + 9y 2 + 25z 2 + 49r 2 ' 

which approximately represents 11.38%; see (4.4) below. 

We proceed as follows. Section 2 deals with notation and preliminaries on 
the representation of Gaussian processes. In Section 3 we show the estima- 
tor u := (Xt)te[o,T] is minimax and in the case of independent increments 
we prove a Cramer-Rao bound over all unbiased drift estimators, which is 
attained by u. Then we turn to the construction of superefhcient drift es- 
timators using the Malliavin gradient operator which show, as in the finite 
dimensional case, that the minimax estimator u is not admissible. We also 
construct nonnegative superharmonic functionals using cylindrical function- 
als on Gaussian space and show that the James-Stein estimators of Berger 
and Wolpert [2] can be recovered as particular cases of our approach. Exam- 
ples and numerical simulations for the gain of such estimators are presented 
in Section 4 for the estimation of a deterministic drift. Proofs of the main 
results are provided in Section 5 using the Girsanov theorem, functional 
Bayes estimators and the Malliavin calculus. 

2. Notation. Let T > 0. Consider a real-valued centered Gaussian pro- 
cess (-Xf)tg[o,T] with covariance function 

j(s,t)=E[X s X t ], s,te[0,T), 

on a probability space (£l,J-,F), where T is the a-algebra generated by X. 

We choose to represent (Xt)te[o,T] as an isonormal Gaussian process on the 
real separable Hilbert space H generated by the functions Xt(s) = min(s,t), 
s,t € [0,T], with the scalar product {-,-)h and norm || • \\h defined by 

(Xt,Xs}H-=j(s,t), s,te[0,T]. 

In this case X is viewed as an isometry X : H — > L 2 (£l, P) with 

X( Xt ):=X t , te[0,T], 

and {X(h) : h G H} is a family of centered Gaussian random variables sat- 
isfying 



E[X(h)X(g)] = {h,g} H , h,g£H. 
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We will assume in addition that j(s,t) has the form 

rsAt 

7(M)=/ K(t,r)K(s,r)dr, s,te[0,T], 
Jo 

where K(-,-) is a deterministic kernel and 

(Kh)(t) := f K(t,s)h(s)ds 
Jo 

is differentiable in t G [0,T]. In this case the scalar product in H satisfies 

(h,g) H = (K*h,K*g) = (h,Tg), 

where V = KK* and K* is the adjoint of K with respect to 

(h,g) := (h,g) L 2([ 0)T ] idt y, 

see [1]. Moreover, for any orthonormal basis (h k )keN °f H, we have the 
expansion 

oo 

X t = J2{Xt,h k ) H X(h k ) 

k=0 

oo 

= ^2( 1 [0,t], I 'h) L 2 ([0,T],dt) X ( h k) 
k=0 

oo 

= J^Th k (t)X(h k ), 

k=0 

t £L [0, T], and the representation 

X t = f t K(t,s)dW s , te[0,T], 
Jo 

where (W / s ) se [ ,T] is a standard Brownian motion; see [1]. 

Note that there exists other constructions of Gaussian processes leading 
to different series decompositions for Xt, for example, Berger and Wolpert 
[2, 11] use Karhunen-Loeve expansions; see [8] for a formulation of our 
results in that setting. 

3. Main results. 

Efficient drift estimator. Recall that the classical linear parametric esti- 
mation problem for the drift of a diffusion consists in estimating the coeffi- 
cient 6 appearing in 



dt t = 9at(Zt)dt + dY t , £o = 0, 
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with a maximum likelihood estimator 9t given by 

(3.1) § T = I i a f t)d ^ t ; 

see [6, 9] in case (lt)t£R + is a Brownian motion. 

In this paper we consider the nonparametric functional estimation of the 
drift of a one-dimensional Gaussian process (Xt)teu.+ with decomposition 

(3.2) dX t = u t dt + dX?, 
where (ut)te[o,T] is an adapted process of the form 

u t = I u s ds, te [0,T], ueL 2 (Q x [0,T]), 
Jo 

and (X") 4g ]R + is a centered Gaussian process under a probability P„ which 
is the translation of P on by u. The expectation under ¥ u will be denoted 
by E u . 

Definition 3.1. A drift estimator £ is called unbiased if 

E u [&]=E> t ], t€[0,T\, 

for all square-integrable adapted process («t)te[0,Tl- It is called adapted if 
the process (£t)te[o,T] i s adapted to the filtration (^Ft)te[o,T] generated by 

Here, the canonical process (X t ) t ^ 0T j will be considered as an unbiased 
estimator of its drift (ut)te\o,T] under P„, with risk defined as 

R(7,M,u) : = E U [\\X - u\\ 2 L2mThd ^] 

= [ T E u [\X?\ 2 Mdt) 
Jo 

T 

l(t,t)fl{dt), 



where \i is a finite weighting Borel measure on [0, T\. In case u is constrained 
to have the form m = 0t , t S [0, T] , 9 £ R, our estimator u satisfies ut = @tT, 
where 6t is given by (3.1), T > 0, with the asymptotics 0t — > # in probability 
as T tends to infinity. 

In the next proposition we note that, as in the finite dimensional Gaussian 
case, the estimator u = (-Xt)te[o,T] is minimax. Note that no adaptedness 
condition is imposed on £ in the infimum (3.3). 



G 
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Proposition 3.2. The estimator u = [Xt)t^[o,T] * s minimax. For all 
u € f2, we have 



R(7,/i,u) =E h 



(3.3) 



/ \X t -u t \ 2 n(dt) 
J o 



inf sup E„ 



T 



\Zt-Vt\ 2 fJL(dt) 



This proposition is proved in Section 5 using functional Bayes estimators. 

We close this section with a more precise statement concerning the ef- 
ficiency of the estimator u = (-Xt)te[o,Tl in the case where {X t ) t £[o,T] h as 
independent increments, that is, j(s,t) is of the form 



7(M) 



sAt 



aldu, 



where a £ L 2 ([0,T},dt) is an a.e. nonvanishing function with 



(3.4) 



E 



r«2 



r/.s- 



< oo. 



In this case the following proposition allows us to compute a Cramer-Rao 
bound attained by u, and shows that u = (Xt) t ^ 0T -i is an efficient estimator. 



Proposition 3.3. Cramer-Rao inequality. For any unbiased and adapted 
estimator £ of u we have 

rT 



(3.5) 



E„ 



\it-u t \^{dt) 



> R(cr, fx, u) 



where u 6 L 2 (fl x [0,T],P u (g) ^) is adapted and the Cramer-Rao type bound 

R(o", n, u) := / / a 2 dsfi(dt) 
Jo Jo 

is independent of u and is attained by the efficient estimator u = X. 

This proposition is proved in Section 5 using the Girsanov theorem. 

Superefficient drift estimators. Next we construct a family of superef- 
ficient estimators of u of the form X + £, whose mean square error is 
strictly smaller than the minimax risk R(7, /i,u) of Proposition 3.2 when 
£ € L 2 ([0,T] x fi,P u (g> fj.) is a suitably chosen stochastic process. This esti- 
mator will be biased and anticipating with respect to the Brownian filtration. 
In the next theorem we follow Stein's argument which uses integration by 
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parts, but we replace (1.1) by the duality relation (5.5) between the gradient 
and divergence operators on Gaussian space. 

Before turning to the main result, we need to introduce some elements 
of analysis and Malliavin calculus on Gaussian space, see, for example, [7]. 
Fix {h n ) n >\ a total subset of H, and let S denote the space of cylindrical 
functionals of the form 

(3.6) F = f n (X u (h 1 ),...,X u (h n )), 

where f n is in the space of infinitely differentiable rapidly decreasing func- 
tions on M. n , n > 1. 

Definition 3.4. The Malliavin derivative D is defined as 

n 

D t F = Y / ^h l (t)d l f n (X u (h l ),...,X u (h n )), te[0,T], 

i=l 

for F G S of the form (3.6). 

It is known that D is closable (cf. Proposition 1.2.1 of [7]) and its closed 
domain will be denoted by Dom(D). 

Definition 3.5. We define the Laplacian A by 



AF = trace L 2 ([0iT]i(iAl) ®2 DDF = 




D t D t Ffi{dt) 



on the space Dom(A) made of all F G Dom(D) such that D t F £ Dom(D), 
t€ [0,T], and (D t D t F) tmT] € L 2 ([0,T],dfi), P-a.s. 

For F G S of the form (3.6), we have 

n 

AF=Y, (rh u Th,) L 2 {[ondll) d i d J f n (X u (h 1 ),...,X u (h n )), 

and it can be easily shown that the operator A is closable; see, for example, 
[8]. We will say that a random variable F in Dom(A) is A-superharmonic 
on Q if 

(3.7) AF(lo) < 0, P(dw)-a.s. 

The next theorem is our main result on the construction of superefficient 
estimators. It is proved in Section 5 using the Malliavin calculus on Gaussian 
space. 
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Theorem 3.6. (i) Unbiased risk estimate. For any £e L 2 (f2 x [0,T],P u <g> 
fi) such that & G Dom(D), t G [0,T], and (A6)te[o,T] e^f^x [0,T],P u ® 
/x), we have 

E u [\\X + ^- U \\ 2 L2{[0T]d) ] 

(3.8) 

= R(7,M,«) + l|^|||2(a x [o,T],p ll ® /1 ) + 2E « / Dt&KM) 

.J 

(ii) Stein-type estimator. For any P-a.s. positive random variable F E 
Dom(D) such that D t F £ Dom(L>) ; t E [0,T], and (A Ai ? )te[o,T] G L 1 (17 x 
[0,T],P M (gi/x), we /iaue 



Ej||X + I)logF-«||| 2([W/i) ] 
(3.9) =R( 7 ,/i,n)-E u [|| J Dlog J F||| 2([0iT] ^ ) ] + 2E u 



AF 



R( 7 ,/i,n)+4E, 



Remarks. 



(a) The A-superharmonicity of F is sufficient but not necessary for 
X + DlogF to be super efficient, namely, it can be replaced by the A- 
superharmonicity of y/~F, which is a weaker assumption; see [3] in the finite 
dimensional case. 

(b) As in [10], the superefficient estimators constructed in this way are 
minimax in the sense that, from Proposition 3.2 and Theorem 3.6(h), for all 
u G H we have, 



E U [\\X + DlogF -u\\ 2 L 2 {[0tT]4 ^} <R(7,/i,n) =infsupE„ / |& 



v t \ dt 



provided Ay F < on a set of strictly positive P-measure, thus showing that 
the minimax estimator u = (Xt)t£[o,T] 1S inadmissible. 

We now turn to some examples of nonnegative superharmonic functionals 
with respect to the Laplacian A, which satisfy the hypotheses of Theorem 
3.6 and will be used in the numerical applications of Section 4. We assume 
that (r/ifc)fc>i is orthogonal in L 2 ([0, T], dfi), and we let 

\k = \\rhk\\L 2 ([o,T],dij,)> k>l. 

For example, in case fi(dt) = dt the sequence (h k )k>l can be realized as the 
solution of the eigenvalue problem 

(3.10) Th k = -X 2 k h k , h k (T)=0, k>l. 
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Superharmonic functionals on Gaussian space can be constructed as cylin- 
drical functionals, by composition with finite-dimensional functions. Prom 
the expression 

n 
i=l 

of A on cylindrical functionals, we check that 

F = f n (X^ 1 X u (h 1 ),...,X~ 1 X u (h n )) 

is superharmonic on $7 if and only if f n is superharmonic on M n . Note that 
when u S H is deterministic, any superharmonic functional of the form 

UK l x u {h 1 ),...,\- 1 x u {h n )) 

can be replaced with 

U{X[ 1 x{h 1 ),...,x^X{K)), 

which retains the same harmonicity property since u is deterministic, and 
can be directly computed from an observation of X. 

Given aGlandkl", let f nab :R n —>R be defined as 



fn,a,b(xi, ...,x n ) = \\x + b\\ a = ((xi + 61) 2 H h (x n + b r 



\2W2 



then f n ,a,b is superharmonic on M n , n > 3, if and only if a £ [4 — 2n,0]. 
Letting 

Pn,a,b 

= f n ^ b (Xi 1 X u (h 1 ),...,X- l X u (h n )), 

we have 

Xr 1 Th i {t)(b t + XT 1 X u {h i )) 



1 \h + Ar 1 ^^)! 2 + • • • + \b n + X^X^iK)^ 
and 



*y/Fn,a,b = E df^Jfn^X^X^), X~ 1 X u (h n )), 
i=l 

since (Fhk)k>i is orthogonal in L 2 ([0,T],dfi). Hence, 



A^F~^_ a(n-2 + a/2)/2 



^fF—b~ |6i + Af^^/n)! 2 + • • • + \b n + Xn l X u {h n )\ 2 
is negative if 4 — 2n < a < 0. On the other hand, 
AF n ,a,b _ a(n + a-2) 



(3.12) 



F n ,a,b |6i + Ar 1 ^^)! 2 + • ■ • + \b n + A^ 1 X"(/i n )| 2 
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is negative for a 6 (2 — n, 0] and vanishes for a = 2 — n. Taking hi = Aj 1 (u, hi), 
i = 1, ... ,n, we have 

[n n x] t 



A logF, 



n,2—n,b " 



(n-2)- 



n Til 2 

llL 2 ([0,T],di) 



where II n denotes the orthogonal projection 

n n 

u n x(t) -.= x^x(h k )Th k (t) = + K 1 ^"(%))r^ fe (t). 

fc=i fe=i 

The resulting estimator 



n,2—n,b 



X t -{n-2)- 



TT Y"l|2 

llL 2 ([0,T],dt) 



iG[0,T], 



is of a James-Stein type, but it is not a shrinkage operator. 

From (3.12) we have AF n 2— n b = 0, hence (3.9) and (3.11) show that 

1 1 D log Fn,2-n,b\\ 2 L 2 {[0,T\y.Vl,¥ u ®dt) 
&-\/Fn,2-n,b 



-4E, 



(3.13) 



\/F n ^-n,b 



(n - 2) 2 E U 



\\l 1 X{h 1 )\ 2 + --- + \\n 1 X{h n )\ 2 



= (n-2) 2 E u [\\U n X\\^ i[ondt) ], 
and the risk of X + D log 2-n & is 

E u [||X + Dlo gj F n , 2 -n, 6 ~~ ' u llL 2 ([0,T],dt)] 

= R( 7 , fi, u)-(n- 2) 2 E u [\\U n X\\^ {[0indt) }. 

4. Numerical application. In this section we present numerical simula- 
tions which allow us to measure the efficiency of our estimators. We use 
the superharmonic functionals constructed as cylindrical functionals in the 
previous section, and we assume that u £ H is deterministic. 

We work in the case of independent increments and we additionally as- 
sume that Ot = (J is constant, t E [0, T], that is, (^Q)te[o,T] is a Brownian 
motion with variance a 2 , Th(t) = a 2 h(t), t G [0, T], and 



R(cr,/i,u) = yT 2 . 



Letting 
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that is, 



h n (t) 



1 12 



a V T 



cos n 



1\ TTt 



2 T 



te [0,T], n> 1, 



provides an orthonormal basis (/i n )n>i of H such that (r/tj.)fe>i is orthogonal 
in L 2 ([0,T],dt), with 



An 



n > 1, 



vr(n-l/2)' 

and satisfies (3.10). The Stein estimator X + Dlog-F of u will be given by 

X(h k ) 



A log F nt2 -n,b = ~ in - 2) J - J2 TT-rr 



T fc =i IV 1 ^(Ml 2 + • • • + \\nX(h n )\ 2 



x sinf f 



1\ TTt 



2 T 



For simulation purposes we will represent the (nondrifted) Brownian motion 
(-X"")t e [o,Tl via the Paley- Wiener expansion 

2T^ sin((n-l/2)(7rt)/r) 



(4.i) x« = a 2 e ^^(*) = E n. 

— i vr — ; 



(n-1/2) 



n=l " n=l 

where (r/ n )n>i are independent standard Gaussian random variables with 
unit variance under P u and 

i-T 



In this case we have 



h n (s)dX%, n>l. 

r) k + (u,hk) 



(4.2) 



Dt log i*L, 2_n h = —{n — 2)\ — > — 



x sin( [A; | — 

2/ T 



The gain of the super efficient estimator X + D\ogF ri: 2- n ,b compared to 
the minimax and efficient estimator u is given by 



G(u, a, T, n) := 



R(<7, ii, u 
From (3.9) and (3.13) we have 



2-n,b 



n>3. 



(4.3) G(u,a,T,n) = 2(n-2) 2 E 



L \/=l 
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hence, 

(4.4) lim G(u, a, T, n) = (n - 2) 2 4tE 

The quantity (4.4) can be evaluated as a Gaussian integral to yield (1.2). 
Unlike in the classical Stein method, we stress that here n becomes a free 
parameter and there is some interest in determining the values of n which 
yield the best performance. The next proposition is proved by standard 
arguments (see [8] for details), and is illustrated by Figure 2. 

Proposition 4.1. For all a,T > 0, and u£ H, we have 

6 

G(u,a,T,n) ~ — ^ 

as n goes to infinity. 

In the sequel we choose ut = at, t G [0, T] , a £ R. Figure 1 gives a sample 
path representation of the process X + DlogF. 



. \/=l 
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In this case, from (4.3) we have 
G(a,a,T,n) = 2(n - 2) 2 E 



1=1 



Vi ~ O'- 



er 



from which it follows that G(a, a, T, n) converges to 



{n-2f 



£(a-i)V 



\l=l 



when a a /T tends to infinity, and is equivalent to 



1 - - 



C7 



n J orT 

as a~ 2 a 2 /T tends to 0. Figure 2 represents the gain in percentage of the su- 
perefficient estimator X + a 2 D\ogF n: 2- n ,b compared to the efficient estima- 
tor u using Monte Carlo simulations, that is, we represent 100 x G(a, a, T, n) 
as a function of n > 3. An optimal value 



opt = argmax{G(a, a,T,n) :n> 3} 



of n exists in general and is equal to 4 when a = a = T = 1. Figure 3 shows 
the variation of the gain as a function of n and T for a = a = 1. Figure 4 
represents the variation of the gain as a function of n and a. 



5. Proofs of the main results. 



Minimaxity. In order to prove the minimaxity of the estimator u = X, 
we will need the notion of Bayes estimator. We will make use of the next 
lemma which is classical in the framework of Gaussian filtering. We say that 
a Gaussian process Z has covariance T and drift v E£l if Z — visa centered 
Gaussian process with covariance T. 

Lemma 5.1. Let Z be a Gaussian process with covariance operator T T 
and drift v £ H , and assume that X is a Gaussian process with drift Z and 
covariance operator T given Z. Then, conditionally to X, Z has drift 

f ^ (/, (r + IY) -1 ^) + x((r + iv)- 1 ^/) 

and covariance 

r T (r + r T )- x r. 



Note that, unlike in Proposition 3.3, no adaptedness or unbiasedness re- 
striction is made on £ in the infimum taken in (5.2) below. 





Fig. 3. Gain as a function of n and T. 
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Proposition 5.2. Bayes estimator. LetW v denote the Gaussian distri- 
bution on 0, with covariance operator T T and drift v E H . The Bayes risk 



(5.1) 





[L 


In 





|& - zt\ 2 n(dt) 



of any drift estimator (£t)te[o,T] on ^ under the prior distribution is 
uniquely minimized by 

C t ' v := (xt, (r T + r)- 1 ^) + x((r T + r)- 1 ^*), t e [o, t], 

which has risk 



(5.2) 



JO 



/ E 2 


[L 


In 





Proof. Let Z denote a Gaussian process with drift v £ H and covari- 
ance r r . From Lemma 5.1, (■^t)te[o,r] nas drift 

t ^ (xt, (r T + r)- 1 ^) + x((r r + r)- 1 ^), t e [o, t], 
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Fig. 4. Gain as a function of n and a. 



16 



N. PRIVAULT AND A. REVEILLAC 



and covariance r T (T r + T) 1 T conditionally to X. Hence, the Bayes risk of 
an estimator £ under the prior distribution is given by 

' dPl(z) 



\£t-zt\ 2 n(dt) 

T 



E 



/ \$ t -Z t \ 2 n(dt) 
Jo 



X 



\£ t -E[Z t \X]\ 2 ti(dt) 



+ E 



Vax{Z t \X)n(dt) 



[ 16 - (xt, (r T + r)- 1 ^) - x((r T + r)- 1 ^**)! V*) 

Jo 
Jo 



which is minimized by 

Ct v := E[z t | x] = ( X u (r T + r)- 1 ^) - x((r r + r)- 1 ^), 



ie[0,T]. 



□ 



Clearly, ^ T ' v is unique in the sense that it is the only estimator which 
minimizes the Bayes risk (5.1). This shows, in particular, that every £ T ' V is 
admissible in the sense that if an estimator £ satisfies 



([o,n.dw 



}<E z [U T ' v -z\\ 2 LH[ o,T], d ^ Z (zQ, 



then 



Jo 



hence, 



(5.3) / K[U-z\\h i[0!T] ^ ) ]dFUz)= FixurpT + ry^rxMdt), 

J s2 J 

and £ = £ T ' V by Proposition 5.2. 

PROOF of Proposition 3.2. We apply Proposition 5.2 in the case 
T T f(t) = r 2 f(t), t £ [0,T]. Clearly, taking £ = yields 



R(7,/i,u) = supE M 

u£Q Uo 



T 



\X t - u t \ 2 n(dt) 



> inf sup E u 

£ uen Uo 



\tt-u t \ 2 n(dt) 
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On the other hand, from Proposition 5.2, for all processes £ we have 



supE u 



\Zt-u t \ 2 fi(dt) 



T 



|& - z t \ 2 n(dt) 



dF T (z) 



> / E, 
Jn 

> F (xu{I + T/T 2 T X Txt)v{dt), 
Jo 



for all t > 0, hence, 



inf sup E u 



l^-utlV(rfi) 



>/ (xt,TXt)v(dt) =R(7,/i,u). 
Jo 



□ 



Cramer-Rao bound. 

Proof of the Cramer-Rao inequality Proposition 3.3. By the 
Girsanov theorem, P u is absolutely continuous with respect to P under the 
condition (3.4), with 



d¥ u = A(u) dP, 



where 



/ f 1 it 1 f T v? 

A(it):=exp / -^dX s --\ ds 
\Jo erf 2 Jo of 



denotes the Girsanov-Cameron-Martin density, and the canonical process 
(X t )te[o,T] becomes a continuous Gaussian semimartingale under P u , with 
quadratic variation o\ dt and drift Ut dt. 
Since £ is unbiased, for all ( G H, we have 

E u+eC [&] = E u+eC [n t + eCt] 

= E„ +£C [n t ] +eE u+e? [Ct] 

= E u+£C [u t }+e( t , te[0,T], eGR, 

hence, 



E 



E[(e t -«t)A(« + eO]| 6 =o 
(6-^)^A(u + eC)| £=0 



(tt-1k)fetegH u + £ 0\e=0 
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E M 



E,, 



itt-ut) 



T 



o a% 

T Cs 



■dX* 



■dX" 



T (sUs 

o al 



ds 



Jo at 



where the exchange between expectation and derivative is justified by classi- 
cal uniform integrability arguments. Thus, by the Cauchy-Schwarz inequal- 
ity and the ltd isometry, we have 



Ct < En 



<*\ 



dX u a 



E u [\Ct-u t \ 



f ^ds^uWtt-utW te[0,T\. 
Jo of. 



It then suffices to take 



(t= f aids, te[0,T], 
Jo 



Vax u [&] =E U [|& - u t \ 2 } > / a 2 s ds, t G [0,T], 



to get 
(5.4) 



which leads to (3.5) after integration with respect to fi(dt). As noted above, 
u = (X t ) tl z[ ^] is clearly unbiased under P u and it attains the lower bound 
R(a,n,u). □ 



Superefficient drift estimators. 



Proof of Theorem 3.6. Let 5:L 2 (Q;H,F U ) -> L 2 (Q,F U ) denote the 
closable adjoint of D, that is, the divergence operator under P u , which sat- 
isfies the integration by parts formula 

(5.5) E u [FS(v)]=E u [(v,DF}], F G Dom(F), v G Dom(<5), 
with the relation 

5(hF) = FX(h) - (h, DF); 

see [7], for F G Dom(D) and h G H such that hF G Dom(<5). Note that (5.5) 
is an infinite-dimensional version of the integration by parts (1.1), which can 
be proved using, for example, the countable sequence of Gaussian random 
variables appearing in the Paley-Wiener expansion (4.1) of X. 
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(i) We have 

E u [||X + £-«||! 2([W(U) ] 



:E,. 



/V" + £*lV(^) 

Jo 

( Vri + iieii 

JO 



L 2 (nx[0,T],P„®/i) 



+ 2E U 



R( 7 ,^,u) + ||f||i3 ( n X [o,n^® M ) + 2E « 



T 



and 



E u [£ t X«]=E u [£ t X«( Xt )] 
= E„[6*(xt)] 
= E u [<Xt,tf&>] 

= E U [A6], *e[0,T], 

hence, (3.8) holds. 

(ii) Next, we specialize the above argument to processes £ of the form 

6 = AlogF, i£[0,T], 

where F is an a.s. strictly positive and sufficiently smooth random variable. 
From (3.8) we have 

E u [\\X + DlogF-u\\ 2 L2 ^ T>) ] 

= R( 7 ,^n) + || J DlogF||| 2{Qx[0iT])Pu ^ ) + 2E u / D t D t logF/i(dt) 

.J 

= R(7,^,n) +E„ 





D t F 


in 


F 



+ 2D t D t logF)n{dt) 



and we use the relation 



D t F 



F 



+ 2D t D t \ogF = 2- 



D t D t F 



D t F 



F 



t£[0,T}. 



Concerning the last equality, we note that 



D t F 



F 



F 



D, 



D t F 



n DtDtF 
1 F 
which implies 

(5.6) 

V-f' * -Jo 
and allows us to conclude from (i). □ 



D t D t Vf, t£[0,T], 



Ay/F _ AF 

Jf f 



\D t \ogF\ 2 ^{dt), 
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